Active metric learning for supervised classification
نویسندگان
چکیده
Abstract Clustering and classification critically rely on distance metrics that provide meaningful comparisons between data points. To this end, learning optimal functions from data, known as metric learning, aims to facilitate supervised classification, particularly in high-dimensional spaces where visualization is challenging or infeasible. In particular, the Mahalanobis default choice due simplicity interpretability a transformation of simple Euclidean using combination rotation scaling. work, we present several novel contributions both by way formulation well solution methods. Our approach motivated agglomerative clustering with certain modifications enable natural interpretation user-defined classes clusters metric. generalizes improves upon leading methods removing reliance pre-designated “target neighbors,” “triplets,” “similarity pairs.” Starting definition generalized has second order term, propose an objective function for selection does not aim isolate each other like most previous but tries distort space minimally aggregating co-class members into local clusters. Further, formulate problem mixed-integer optimization can be solved efficiently small/medium datasets approximated larger datasets. Another salient feature our method it facilitates active recommending precise regions sample improve performance. These are indicated boundary outlier points dataset defined This targeted acquisition significantly reduce computation ensuring training completeness, representativeness, economy, which could also advantages established Deep Learning Random Forests. We demonstrate computational performance through intuitive examples, followed results real image benchmark
منابع مشابه
Active Metric Learning for Supervised Classification
Clustering and classification critically rely on distance metrics that provide meaningful comparisons between data points. We present mixedinteger optimization approaches to find optimal distance metrics that generalize the Mahalanobis metric extensively studied in the literature. Additionally, we generalize and improve upon leading methods by removing reliance on pre-designated “target neighbo...
متن کاملSemi-supervised Distance Metric Learning for Visual Object Classification
This paper describes a semi-supervised distance metric learning algorithm which uses pairwise equivalence (similarity and dissimilarity) constraints to discover the desired groups within high-dimensional data. As opposed to the traditional full rank distance metric learning algorithms, the proposed method can learn nonsquare projection matrices that yield low rank distance metrics. This brings ...
متن کاملSupervised Learning for Classification
Supervised local tangent space alignment is proposed for data classification in this paper. It is an extension of local tangent space alignment, for short, LTSA, from unsupervised to supervised learning. Supervised LTSA is a supervised dimension reduction method. It make use of the class membership of each data to be trained in the case of multiple classes, to improve the quality of classificat...
متن کاملSemi-Supervised Active Learning for Sound Classification in Hybrid Learning Environments
Coping with scarcity of labeled data is a common problem in sound classification tasks. Approaches for classifying sounds are commonly based on supervised learning algorithms, which require labeled data which is often scarce and leads to models that do not generalize well. In this paper, we make an efficient combination of confidence-based Active Learning and Self-Training with the aim of minim...
متن کاملSemi-Supervised Metric Learning Using Pairwise Constraints
Distance metric has an important role in many machine learning algorithms. Recently, metric learning for semi-supervised algorithms has received much attention. For semi-supervised clustering, usually a set of pairwise similarity and dissimilarity constraints is provided as supervisory information. Until now, various metric learning methods utilizing pairwise constraints have been proposed. The...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Computers & Chemical Engineering
سال: 2021
ISSN: ['1873-4375', '0098-1354']
DOI: https://doi.org/10.1016/j.compchemeng.2020.107132